Ch.12 Orthogonal Projection

Onto a Line

Imagine "walking" from the tip of $\vec{v}$ to the line in an orthogonal fashion. Since the line is the span of some vector $l=\{c\cdot\vec{s}\space|\space c\in\mathbb{R}\}$ , we're looking for some $c_p$ so that $c_p\vec{s}$ is orthogonal to $\vec{v}-c_p\vec{s}$

To solve, notice that $\vec{v}-c_p\vec{s}$ is orthogonal to $\vec{s}$ itself, so
$\vec{s}\cdot(\vec{v}-c_p\vec{s})=0\rightarrow \vec{s}\cdot\vec{v}-c_p\vec{s}^2=0\rightarrow c_p=\frac{\vec{s}\cdot\vec{v}}{\vec{s}\cdot\vec{s}}$
Thus, we have the orhtogonal projection of $\vec{v}$ onto $l=[\vec{s}]$ is
$\text{proj}_{[\vec{s}]}(\vec{v})=\frac{\vec{v}\cdot\vec{s}}{\vec{s}\cdot\vec{s}}\cdot\vec{s}$
This vector $\vec{w}=\text{proj}_{[\vec{s}]}(\vec{v})$ is the only vector in the line $[\vec{s}]$ such that $\vec{v}-\vec{w}$ is orthogonal to any vector in $[\vec{s}]$

Example 12.1

The projection of this $\mathbb{R}^3$ vector into the line
$\vec{v}=\begin{pmatrix}3\\1\\1\end{pmatrix}\space\space L=\{c\cdot\begin{pmatrix}1\\-2\\1\end{pmatrix}\space|\space c\in\mathbb{R}\}$
is the vector
$\text{proj}_L{\vec{v}}=\frac{\begin{pmatrix}3\\1\\1\end{pmatrix}\cdot\begin{pmatrix}1\\-2\\1\end{pmatrix}}{\begin{pmatrix}1\\-2\\1\end{pmatrix}\cdot\begin{pmatrix}1\\-2\\1\end{pmatrix}}\cdot\begin{pmatrix}1\\-2\\1\end{pmatrix}=\frac{2}{6}\begin{pmatrix}1\\-2\\1\end{pmatrix}=\begin{pmatrix}1/3\\-2/3\\1/3\end{pmatrix}$

Gram-Schmidt Orthogonalization

Notice how $\vec{v}$ can be decomposed into
$\vec{v}=\text{proj}_{[\vec{s}]}(\vec{v})+\left(\vec{v}-\text{proj}_{[\vec{s}]}(\vec{v})\right)$
These are orthogonal, and can be seen as "non-interacting," i.e. linearly independent

Vectors $\vec{v}_1,...,\vec{v}_k\in\mathbb{R}$ are mutually orthogonal if any pair of them are orthogonal
i.e. for any $i\ne j$ $\vec{v}_i$ and $\vec{v}_j$ are orthogonal
For example, the standard basis vectors are mutually orthogonal

If the vectors in a set $\{\vec{v}_1,...,\vec{v}_k\}\subset\mathbb{R}^n$ are mutually orthogonal and nonzero, then the set is linearly independent.

Proof

Consider $c_1\vec{v}_1+\cdots+c_k\vec{v}_k=0$ . For $i\in\{1,...,k\}$ , taking the dot product on both sides gives
$\vec{v}_i\cdot(c_1\vec{v}_1+\cdots+c_k\vec{v}_k)=\vec{v}_i\cdot0\\ \implies c_i(\vec{v}_i\cdot\vec{v}_i)=0$
Since $\vec{v}_i\ne0$ , we must have $\vec{v}_i\cdot\vec{v}_i\ne0$ , therefore $c_i=0$ .
Since all $c_i=0$ , the set is linearly independent.

A corollary of this is that $k$ mutually orthogonal vectors of a $k$ -dimensional vector space is a basis, because a subset of $k$ linearly independent vectors of a $k$ -dimensional space is a basis.
An orthogonal basis for a vector space is a basis of mutually orthogonal vectors

Gram-Schmidt Orthogonalization

If $\langle\vec{\beta}_1,...,\vec{\beta}_k\rangle$ is a basis for a subspace of $\mathbb{R}^n$ , then the vectors
$\begin{array}{rcl} \vec{\kappa}_1&=&\vec{\beta}_1\\ \vec{\kappa}_2&=&\vec{\beta}_2-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_2)\\ \vec{\kappa}_3&=&\vec{\beta}_3-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_3)-\text{proj}_{[\vec{\kappa}_2]}(\vec{\beta}_3)\\ &\vdots\\ \vec{\kappa}_k&=&\vec{\beta}_k-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_k)-\cdots-\vec{\beta}_3-\text{proj}_{[\vec{\kappa}_{k-1}]}(\vec{\beta}_k) \end{array}$
form an orthogonal basis for the same subspace. Moreover,
$\text{span}({\vec{\kappa}_1,...,\vec{\kappa}_i})=\text{span}({\vec{\beta}_1,...,\vec{\beta}_i})$ for all $i=1,...k$

Proof

We use induction to show that each $\kappa_i$ :

is nonzero
is in the span of $\langle\vec{\beta}_1,...,\vec{\beta}_i\rangle$
is orthogonal to all preceding vectors
Then by the previous corollary, $\langle\vec{\kappa}_1,...,\vec{\kappa}_k\rangle$ is a basis

Case $i=1$ : this is trivial
Case $i=2$ : we have
$\vec{\kappa}_2=\vec{\beta}_2-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_2)=\vec{\beta}_2-\frac{\vec{\beta}_2\cdot\vec{\kappa}_1}{\vec{\kappa}_1\cdot\vec{\kappa}_1}\cdot\vec{\kappa}_1=\vec{\beta}_2-\frac{\vec{\beta}_2\cdot\vec{\kappa}_1}{\vec{\kappa}_1\cdot\vec{\kappa}_1}\cdot\vec{\beta}_1$
This is nonzero because the $\vec{\beta}$ 's are linearly independent, is clearly in the span $\langle\vec{\beta}_1,\vec{\beta}_2\rangle$ , and is orthogonal to $\vec{\kappa}_1$ because the projection is orthogonal
Case $i=3$ : we have
$\vec{\kappa}_3=\vec{\beta}_3-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_3)-\text{proj}_{[\vec{\kappa}_2]}(\vec{\beta}_3)\\=\vec{\beta}_3-\frac{\vec{\beta}_3\cdot\vec{\kappa}_1}{\vec{\kappa}_1\cdot\vec{\kappa}_1}\cdot\vec{\kappa}_1-\frac{\vec{\beta}_3\cdot\vec{\kappa}_2}{\vec{\kappa}_2\cdot\vec{\kappa}_2}\cdot\vec{\kappa}_2\\=\vec{\beta}_3-\frac{\vec{\beta}_2\cdot\vec{\kappa}_1}{\vec{\kappa}_1\cdot\vec{\kappa}_1}\cdot\vec{\beta}_1-\frac{\vec{\beta}_3\cdot\vec{\kappa}_2}{\vec{\kappa}_2\cdot\vec{\kappa}_2}\cdot(\vec{\beta}_2-\frac{\vec{\beta}_2\cdot\vec{\kappa}_1}{\vec{\kappa}_1\cdot\vec{\kappa}_1}\cdot\vec{\beta}_1)$
This is nonzero and in the span $\langle\vec{\beta}_1,\vec{\beta}_2,\vec{\beta}_3\rangle$ becuase they are linearly independent, and it is not hard to check that this is orthogonal to $\vec{\kappa}_1$ and $\vec{\kappa}_2$
Continue in this fashion to prove for all $i=1,...,k$

Note that if $\langle\vec{\beta}_1,...,\vec{\beta}_k\rangle$ is already orthogonal, the process just gives $\vec{\kappa}_i=\vec{\beta}_i$ for $i=1,...,k$

Example 12.2

Derive an orthogonal basis $K=\langle\vec{\kappa}_1,\vec{\kappa}_2\rangle$ for the basis
$B=\langle\begin{pmatrix}1\\2\end{pmatrix},\begin{pmatrix}1\\3\end{pmatrix}\rangle$

First, $\vec{\kappa}_1=\vec{\beta}_1=\begin{pmatrix}1\\2\end{pmatrix}$
Then,
$\vec{\kappa}_2=\vec{\beta}_2-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_2)=\begin{pmatrix}1\\3\end{pmatrix}-\frac{\begin{pmatrix}1\\3\end{pmatrix}\cdot\begin{pmatrix}1\\2\end{pmatrix}}{\begin{pmatrix}1\\2\end{pmatrix}\cdot\begin{pmatrix}1\\2\end{pmatrix}}\cdot\begin{pmatrix}1\\2\end{pmatrix}=\begin{pmatrix}-2/5\\1/5\end{pmatrix}$
Thus, $K=\langle\begin{pmatrix}1\\2\end{pmatrix},\begin{pmatrix}-2/5\\1/5\end{pmatrix}\rangle$
Note that because $\begin{pmatrix}1\\2\end{pmatrix}\cdot\begin{pmatrix}-2/5\\1/5\end{pmatrix}=0$ , they are orthogonal

Example 12.3

Derive an orthogonal basis $K$ for
$B=\langle\begin{pmatrix}1\\1\\2\end{pmatrix},\begin{pmatrix}-1\\2\\1\end{pmatrix},\begin{pmatrix}0\\3\\-1\end{pmatrix}\rangle$

$\vec{\kappa}_1=\vec{\beta}_1=\begin{pmatrix}1\\1\\2\end{pmatrix}$
$\vec{\kappa}_2=\vec{\beta}_2-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_2)=\begin{pmatrix}-1\\2\\1\end{pmatrix}-\frac{\begin{pmatrix}-1\\2\\1\end{pmatrix}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}}{\begin{pmatrix}1\\1\\2\end{pmatrix}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}=\begin{pmatrix}-1\\2\\1\end{pmatrix}-\frac{1}{2}\begin{pmatrix}1\\1\\2\end{pmatrix}=\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}$
$\vec{\kappa}_3=\vec{\beta}_3-\text{proj}_{[\vec{\kappa}_1]}(\vec{\beta}_3)-\text{proj}_{[\vec{\kappa}_2]}(\vec{\beta}_3)= \begin{pmatrix}0\\3\\-1\end{pmatrix}-\frac{\begin{pmatrix}0\\3\\-1\end{pmatrix}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}}{\begin{pmatrix}1\\1\\2\end{pmatrix}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}}\cdot\begin{pmatrix}1\\1\\2\end{pmatrix}-\frac{\begin{pmatrix}0\\3\\-1\end{pmatrix}\cdot\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}}{\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}\cdot\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}}\cdot\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}\\\space\\ =\begin{pmatrix}0\\3\\-1\end{pmatrix}-\frac{1}{6}\begin{pmatrix}1\\1\\2\end{pmatrix}-\frac{9/2}{9/2}\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix}=\begin{pmatrix}4/3\\4/3\\-4/3\end{pmatrix}$
So in summary,
$K=\langle\begin{pmatrix}1\\1\\2\end{pmatrix},\begin{pmatrix}-3/2\\3/2\\0\end{pmatrix},\begin{pmatrix}4/3\\4/3\\-4/3\end{pmatrix}\rangle$

The orthogonal basis $K$ can be normalized to have length $1$ , making it an orthonormal basis

A family of vectors in $\mathbb{R}^n$ is orthonormal if they are mutually orthogonal and all have length 1.
In other words, for $i\in\{1,...k\}$ with $i<j$ , $\{\vec{\beta}_1,...,\vec{\beta}_l\}\subseteq\mathbb{R}^n$ is orthonormal if $\vec{\beta}_i\cdot\vec{\beta}_j=0$ and $\vec{\beta}_i\cdot\vec{\beta}_i=1$
If it is also a basis, then it is an orthonormal basis

Summary of Gram-Schmidt process

Any subspace of $\mathbb{R}^n$ has an orthogonal basis
if $B_M=\langle\vec{b}_1,...,\vec{b}_k\rangle$ is an orthonormal basis for subspace $M$ of $\mathbb{R}^n$ , then for any vector $\vec{v}\in M$
$\text{Rep}_{B_M}(\vec{v})=\begin{pmatrix}\vec{v}\cdot\vec{b}_1\\\vdots\\\vec{v}\cdot\vec{b}_k\end{pmatrix}$
or equivalently
$\vec{v}=(\vec{v}\cdot\vec{b}_1)\vec{b}_1+\cdots+(\vec{v}\cdot\vec{b}_k)\vec{b}_k$

Proof

Since

B_M

is a basis for

M

, we can write

\vec{v}=c_1\vec{b}_1+\cdots+c_k\vec{b}_k

with

c_1,...,c_k\in\mathbb{R}

. To find

c_i

take the dot product with

\vec{b}_i

\begin{array}{lcl}\vec{v}\cdot\vec{b}_i&=&(c_1\vec{b}_1+\cdots+c_i\vec{b}_i+\cdots+c_k\vec{b}_k)\cdot\vec{b}_i\\&=&c_1\vec{b}_1\cdot\vec{b}_1+\cdots+c_i\vec{b}_i\cdots\vec{b}_i+\cdots+c_k\vec{b}_k\cdot\vec{b}_k\\&=&c_i\end{array}

since

\vec{b}_i\cdot\vec{b}_j=0

for

i\ne j

and

\vec{b}_i\cdot\vec{b}_i=1

We will say $\vec{w}\in\mathbb{R}^n$ is orthogonal to subspace $M$ of $\mathbb{R}^n$ if it is orthogonal to every vector $\vec{v}\in M$ , i.e. $\vec{w}\cdot\vec{v}=0$ for all $\vec{v}\in M$
a) The only vector $\vec{v}\in M$ that is orthogonal to $M$ is $\vec{0}$
b) If $\vec{w}_1$ and $\vec{w}_2$ are orthogonal to $M$ , then any $c_1\vec{w}_1+c_2\vec{w}_2$ with $c_1,c_2\in\mathbb{R}$ is also orthogonal to $M$
c) If $B_M=\langle\vec{\beta}_1,...,\vec{\beta}_k\rangle$ is a basis for $M$ , then $\vec{w}$ is orthogonal to $M$ iff $\vec{w}\cdot\vec{\beta}_i=0$ for all $i=1,...,k$

Proofs

a) We must have $\vec{v}$ orthogonal to itself, so
$\vec{v}\cdot\vec{v}=|\vec{v}|^2=0\implies \vec{v}=0$
b) We have $\vec{w}_1\cdot\vec{v}=0$ and $\vec{w}_2\cdot\vec{v}=0$ for all $\vec{v}\in M$ , so
$(c_1\vec{w}_1+c_2\vec{w}_2)\cdot\vec{v}=c_1\vec{w}_1\cdot\vec{v}+c_2\vec{w}_2\cdot\vec{v}=0$
c) If $\vec{w}\in\mathbb{R}^n$ is orthogonal to $M$ , then it is orthogonal to every $\vec{b}_i\in M$ . Conversely, assume $\vec{w}\in\mathbb{R}^n$ is such that $\vec{w}\cdot\vec{b}_i=0$ for all $i=1,...,k$ .
Any vector $\vec{v}\in M$ can be represented as $\vec{v}=c_1\vec{b}_1+\cdots+c_k\vec{b}_k$ , so
$\vec{w}\cdot\vec{v}=\vec{w}\cdot(c_1\vec{b}_1+\cdots+c_k\vec{b}_k)=c_1\vec{w}\cdot\vec{b}_1+\cdots+c_k\vec{w}\cdot\vec{b}_k=0$

Onto a Subspace

This is a generalization of the projection onto a line.

Let $M$ be a subspace of $\mathbb{R}^n$ , then for every vector $\vec{w}\in\mathbb{R}^n$ , there exists a unique vector $\vec{v}\in M$ such that $\vec{w}-\vec{v}$ is orthogonal to $M$ .
We denote $\vec{v}=\text{proj}_M(\vec{w})$ and call it the orthogonal projection of $\vec{w}$ on $M$ .
If $B_M=\langle\vec{b}_1,...,\vec{b}_k\rangle$ is an orthogonal basis for $M$ , then
$\text{proj}_M(\vec{w})=(\vec{w}\cdot\vec{b}_1)\vec{b}_1+\cdots+(\vec{w}\cdot\vec{b}_k)\vec{b}_k$

Proof

The vector $\vec{v}=(\vec{w}\cdot\vec{b}_1)\vec{b}_1+\cdots+(\vec{w}\cdot\vec{b}_k)\vec{b}_k$ is such that $\vec{w}-\vec{v}$ is orthogonal to $M$ . Since $\vec{v}\in M$ and $B_M$ is an orthogonal basis, $\vec{v}=(\vec{v}\cdot\vec{b}_1)\vec{b}_1+\cdots+(\vec{v}\cdot\vec{b}_k)\vec{b}_k$ .
Therefore,
$\vec{v}\cdot\vec{b}_1=\vec{w}\cdot\vec{b}_1,\space...,\space\vec{v}\cdot\vec{b}_k=\vec{w}\cdot\vec{b}_k$
This implies $(\vec{w}-\vec{v})\vec{b}_i=0$ for all $i=1,...,k$ , so by c) from before $\vec{w}-\vec{v}$ is orthogonal to $M$ .
Now suppose $\vec{v}_1,\vec{v}_2\in M$ are such that $\vec{w}-\vec{v}_1$ and $\vec{w}-\vec{v}_2$ are orthogonal to $M$ . By b) from before, $(\vec{w}-\vec{v}_1)-(\vec{w}-\vec{v}_2)=\vec{v}_2-\vec{v}_1$ is orthogonal to $M$ , but $\vec{v}_2-\vec{v}_1\in M$ , so by a) $\vec{v}_2-\vec{v}_1=0\implies\vec{v}_2=\vec{v}_1$
proving its uniqueness

Let $M$ be a subspace of $\mathbb{R}^n$ . The map $\text{proj}_M:\mathbb{R}^n\to M,\vec{w}\mapsto\text{proj}_M(\vec{w})$ is a linear map.

Proof

We must show that for $\vec{w}_1,\vec{w}_2\in\mathbb{R}^n$
$\text{proj}_M(c_1\vec{w}_1+c_2\vec{w}_2)=c_1\text{proj}_M(\vec{w}_1)+c_2\text{proj}_M(\vec{w}_2)$
Both $\vec{w}_1-\text{proj}_M(\vec{w}_1)$ and $\vec{w}_2-\text{proj}_M(\vec{w}_2)$ are orthogonal to $M$ . Therefore, the linear combination of those vectors
$c_1(\vec{w}_1-\text{proj}_M(\vec{w}_1))+c_2(\vec{w}_2-\text{proj}_M(\vec{w}_2))\\=(c_1\vec{w}_1+c_2\vec{w}_2)-(c_1\text{proj}_M(\vec{w}_1)+c_2\text{proj}_M(\vec{w}_2))$
is also orthogonal to $M$
Since $c_1\text{proj}_M(\vec{w}_1)+c_2\text{proj}_M(\vec{w}_2)\in M$ , we must have
$c_1\text{proj}_M(\vec{w}_1)+c_2\text{proj}_M(\vec{w}_2)=\text{proj}_M(c_1\vec{w}_1+c_2\vec{w}_2)$

The orthogonal complement of a subspace $M$ of $\mathbb{R}^n$ is
$M^{\perp}=\{\vec{w}\in\mathbb{R}^n\space|\space\vec{w}\text{ is orthogonal to } M\}$
(read " $M$ perp")

Example 12.4

Find the orthogonal compoenent of the plane in $\mathbb{R}^3$
$P=\{\begin{pmatrix}x\\y\\z\end{pmatrix}\space|\space 3x+2y-z=0\}$

First, find a basis for $P$
$B=\langle\begin{pmatrix}1\\0\\3\end{pmatrix},\begin{pmatrix}0\\1\\2\end{pmatrix}\rangle$

Steps

We have

z=3x+2y

, so

P=\left\{\begin{pmatrix}1\\0\\3\end{pmatrix}x+\begin{pmatrix}0\\1\\2\end{pmatrix}y\space|\space x,y\in\mathbb{R}\right\}

A $\vec{v}$ that is orthogonal to every vector in $B$ is orthogonal to every vector in $\text{span}(B)=P$
So this gives two conditions:
$\begin{array}{cc}\begin{pmatrix}1\\0\\3\end{pmatrix}\cdot\begin{pmatrix}v_1\\v_2\\v_3\end{pmatrix}=0&\begin{pmatrix}0\\1\\2\end{pmatrix}\cdot\begin{pmatrix}v_1\\v_2\\v_3\end{pmatrix}=0\end{array}$
This gives a linear system
$P^{\perp}=\{\begin{pmatrix}v_1\\v_2\\v_3\end{pmatrix}\space|\space\begin{pmatrix}1&0&3\\0&1&2\end{pmatrix}\begin{pmatrix}v_1\\v_2\\v_3\end{pmatrix}=\begin{pmatrix}0\\0\end{pmatrix}\}$
we therefore must find the nullspace of the matrix
$P^\perp=\{\begin{pmatrix}-3\\-2\\1\end{pmatrix}t\space|\space t\in\mathbb{R}\}$

For a subspace $M$ and the orthogonal complement $M^\perp$ ,

$M^\perp$ is itself a subspace
$M\cap M^\perp=\{\vec{0}\}$
For every $\vec{w}\in\mathbb{R}^n$ , $\vec{w}-\text{proj}_M(\vec{w})\in M^\perp$
The span of $M^\perp\cup M$ is all of $\mathbb{R}^n$
If $\text{dimension}(M)=k$ , then $\text{dimension}(M^\perp)=n-k$

Proofs

$\vec{0}\in M$ and $\vec{0}\cdot\vec{v}=0$ for all $\vec{v}\in M$ so $\vec{0}\in M^\perp$ as well.
From b) from before, $M^\perp$ is closed under vector addition and scalar multiplication. Thus, $M^\perp$ is a subspace of $\mathbb{R}^n$

From part a), the only vector in $M$ that is orthogonal to $M$ is $\vec{0}$ , so $M\cap M^\perp=\{\vec{0}\}$

By definition of $\text{proj}_M(\vec{w})$ , $\vec{w}-\text{proj}_M(\vec{w})$ is orthogonal to $M$ , so $\vec{w}-\text{proj}_M(\vec{w})\in M^\perp$

For any $\vec{w}\in\mathbb{R}^n$ , we have $\vec{w}=(\vec{w}-\text{proj}_M(\vec{w}))+\text{proj}_M(\vec{w})$
but $\vec{w}-\text{proj}_M(\vec{w})\in M^\perp$ and $\text{proj}_M(\vec{w})\in M$

First, suppose $\text{dimension}(M^\perp)=l$
Then choose orthonormla bases $B_M=\langle\vec{b}_1,...,\vec{b}_k\rangle$ of $M$ and $B_{M^\perp}=\langle\vec{b}_{k+1},...,\vec{b}_{k+l}\rangle$ of $M^\perp$
$B_M$ spans $M$ and $B_{M^\perp}$ spans $M^\perp\implies\langle\vec{b}_1,...,\vec{b}_k,\vec{b}_{k+1},...,\vec{b}_{k+l}\rangle$ spans $\mathbb{R}^n$ by (3)
We consider $\vec{b}_i\cdot\vec{b}_j$ for $i<j$ :
If $j\le k$ , since $\langle\vec{b}_1,...,\vec{b}_k\rangle$ is orthonormal, $\vec{b}_i\cdot\vec{b}_j=0$ .
If $k+1\le i$ , since $\langle\vec{b}_{k+1},...,\vec{b}_{k+l}\rangle$ is orthonormal, $\vec{b}_i\cdot\vec{b}_j=0$ .
If $i\le k$ and $k+1\le j$ , then $\vec{b}_i\in M$ and $\vec{b}_j\in M^\perp$ , so they are perpendicular, thus $\vec{b}_i\cdot\vec{b}_j=0$ .
So, the family $\{\vec{b}_1,...,\vec{b}_k,\vec{b}_{k+1},...,\vec{b}_{k+l}\}$ is linearly independent, nonzero, and span $\mathbb{R}^n$ . Therefore, $k+l=n\implies l=n-k$ , finishing the proof.

If $M$ is a subspace of $\mathbb{R}^n$ , then $M$ is the orthogonal complement of $M^\perp$ , i.e. $(M^\perp)^\perp=M$
For every $\vec{w}\in\mathbb{R}^n$ ,
$\vec{w}=\text{proj}_M(\vec{w})+\text{proj}_{M^\perp}(\vec{w})$

Proof

From the definition of $M^\perp$ , if $\vec{v}\in M$ then $\vec{v}$ is orthogonal to every vector in $M^\perp$ , so $\vec{v}\in(M^\perp)^\perp$ , and $M\subseteq(M^\perp)^\perp$ .
Furthermore, we know that $\text{dimension}(M)+\text{dimension}(M^\perp)=n$ and $\text{dimension}(M^\perp)+\text{dimension}((M^\perp)^\perp)=n$ , so $\text{dimension}(M)=\text{dimension}((M^\perp)^\perp)$
With those two facts, we can conclude that $M=(M^\perp)^\perp$
For the second part, define $\vec{w}^\perp=\vec{w}-\text{proj}_M(\vec{w})$ . Since $\vec{w}-\vec{w}^\perp=\text{proj}_M(\vec{w})\in M=(M^\perp)^\perp$ , we have that $\vec{w}-\vec{w}^\perp$ is orthogonal to $M^\perp$ , with $\vec{w}\in M^\perp$ , so $\vec{w}-\vec{w}^\perp=\vec{w}-\text{proj}_{M^\perp}(\vec{w})\implies\vec{w}^\perp=\text{proj}_{M^\perp}(\vec{w})$
Finally, $\vec{w}=\vec{w}^\perp+\text{proj}_M(\vec{w})=\text{proj}_{M^\perp}(\vec{w})+\text{proj}_M(\vec{w})$

Given a subspace $M\subseteq\mathbb{R}^n$ , how can we compute $\text{proj}_M(\vec{w})$ of a vector $\vec{w}\in\mathbb{R}^n$ ?
We will suppose the basis for $M$ is $B=\langle\vec{b}_1,...,\vec{b}_k\rangle$
If $B$ is an orthonormal basis, then we know
$\text{proj}_M(\vec{w})=(\vec{w}\cdot\vec{b}_1)\vec{b}_1+\cdots+(\vec{w}\cdot\vec{b}_k)\vec{b}_k=UU^T\vec{w}$
or equivalently
$\text{Rep}_{B_M}(\text{proj}_M(\vec{w}))=\begin{pmatrix}\vec{w}\cdot\vec{b}_1\\\vdots\\\vec{w}\cdot\vec{b}_k\end{pmatrix}$

If $B$ is an orthogonal basis, then
$\langle\frac{\vec{b}_1}{|\vec{b}_1|},...,\frac{\vec{b}_k}{|\vec{b}_k|}\rangle$
is orthonormal.

If $B$ isn't orthogonal, you could use Gram-Schmidt, but we use a more convenient formula:

Let $M\subseteq\mathbb{R}^n$ be a subspace with basis $\langle\vec{b}_1,...,\vec{b}_k\rangle$ and let $A$ be the matrix whose columns are the $\vec{b}_i$ 's. Then
$\text{proj}_M(\vec{v})=c_1\vec{b}_1+\cdots+c_k\vec{b}_k$
where the $c_i$ 's are the entries of the vector
$(A^TA)^{-1}A^T\cdot\vec{v}$
or equivalently,
$\text{proj}_M(\vec{v})=A(A^TA)^{-1}A^T\cdot\vec{v}$

Proof

Given: $\langle\vec{b}_1,...,\vec{b}_k\rangle$ is the basis of $M\subseteq\mathbb{R}^n$ and $A$ is an $n\times k$ matrix with column $i$ being $\vec{b}_i$
$\text{proj}_M(\vec{v})\in M\implies\text{proj}_M(\vec{v})=c_1\vec{b}_1+\cdots+c_k\vec{b}_k=A\vec{c}$ where $\vec{c}=\begin{pmatrix}c_1\\\vdots\\c_k\end{pmatrix}$
$\vec{v}-\text{proj}_M(\vec{v})$ is orthogonal to every $\vec{b}_i$ , which is every row of $A^T\implies A^T(\vec{v}-\text{proj}_M(\vec{v}))=0$
$\implies A^T(\vec{v}-A\vec{c})=A^T\vec{v}-A^TA\vec{c}=0\implies\vec{c}=(A^TA)^{-1}A^T\vec{v}$ (check $A^TA$ is invertible)
Thus, $\text{proj}_M(\vec{v})=A\vec{c}=A(A^TA)^{-1}A^T\vec{v}$

Note that $(A^TA)^{-1}\ne A^{-1}(A^T)^{-1}$ because $A$ is not square

Example 12.5

Project $\vec{v}=\begin{pmatrix}1\\-1\\1\end{pmatrix}$ onto the plane $P=\{\begin{pmatrix}x\\y\\z\end{pmatrix}\space|\space x+z=0\}$

A basis for $P$ is $\langle\begin{pmatrix}0\\1\\0\end{pmatrix},\begin{pmatrix}1\\0\\-1\end{pmatrix}\rangle$ so
$A=\begin{pmatrix}0&1\\1&0\\0&-1\end{pmatrix}\space A^T=\begin{pmatrix}0&1&0\\1&0&-1\end{pmatrix}$
Now, we simply compute $A(A^TA)^{-1}A^T\vec{v}$
$A^TA=\begin{pmatrix}1&0\\0&2\end{pmatrix}$
$(A^TA)^{-1}=\begin{pmatrix}1&0\\0&1/2\end{pmatrix}$
$(A^TA)^{-1}A^T=\begin{pmatrix}0&1&0\\1/2&0&-1/2\end{pmatrix}$
$A(A^TA)^{-1}A^T=\begin{pmatrix}1/2&0&-1/2\\0&1&0\\-1/2&0&1/2\end{pmatrix}$
Finally
$\text{proj}_P(\vec{v})=\begin{pmatrix}1/2&0&-1/2\\0&1&0\\-1/2&0&1/2\end{pmatrix}\begin{pmatrix}1\\-1\\1\end{pmatrix}=\begin{pmatrix}0\\-1\\0\end{pmatrix}$

Given a subspace $M\subseteq\mathbb{R}^n$ , the distance from $\vec{w}\in\mathbb{R}^n$ to $M$ is the smallest possible distance from $\vec{w}$ to a point on $M$
The distance from $\vec{w}$ to $M$ is $|\vec{w}-\text{proj}_M(\vec{w})|$ , or equivlaently, $|\vec{w}-\vec{v}|\ge|\vec{w}-\text{proj}_M(\vec{w})|$ for all $\vec{v}\in M$

Proof

We know $\vec{w}=\text{proj}_M(\vec{w})+\text{proj}_{M^\perp}(\vec{v})$
$\implies\vec{w}-\vec{v}=(\text{proj}_M(\vec{w})-\vec{v})+\text{proj}_{M^\perp}(\vec{v})$
Since $\vec{v}\in M$ , $\text{proj}_M(\vec{w})-\vec{v}\in M$ , so is orthogonal to $\vec{w}-\text{proj}_M(\vec{w})=\text{proj}_{M^\perp}(\vec{w})\in M^\perp$ . Therefore, by Pythagorean Theorem,
$|\vec{w}-\vec{v}|^2=|\text{proj}_M(\vec{w})-\vec{v}|^2+|\vec{w}-\text{proj}_M(\vec{w})|^2$
So $|\vec{w}-\vec{v}|\ge|\vec{w}-\text{proj}_M(\vec{w})|$ , with equality when $\text{proj}_M(\vec{w})=\vec{v}$